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Motivated by claims about the nature of the observed timescales in protein systems said to fold 
"downhill" , we have studied the finite, linear master equation 

Pn - kfPn-1 + kuPn + 1 ~ (kf + k u )p n 

which is a model of the downhill process. By solving for the system eigenvalues, we prove the often 
stated claim that in situations where there is no free energy barrier, a transition between single 
and multi-exponential kinetics occurs at sufficient bias (towards the native state). Consequences for 
protein folding, especially the downhill folding scenario, are briefly discussed. 



INTRODUCTION 

In studies of protein folding, it is often claimed that at 
sufficient native bias we can expect the system kinetics to 
become multi-exponential or "downhill" [IH3]. While a 
relatively simple statement, it is often presented without 
proof nor even heuristic justification. 

There are many reasons why such a statement is not 
accompanied by evidence. Foremost is the fact that pro- 
tein folding is often modeled by one dimensional reaction 
coordinates, on which dynamics are governed by a Smolu- 
chowski equation. Even in situations where such a model 
is appropriate, in the continuous limit it is very hard to 
determine the system's timescale spectrum. What one 
is truly interested in are the eigenvalues of the master 
equation, which is the Smoluchowski representation are 
blurred into a continuous spectral density by the limiting 
procedure used to transform a discrete master equation 
into a continuous propagator [4 . 

This blurring is often mirrored in experimental studies, 
where kinetic traces, A(t), have sometimes been fit to 
stretched exponentials 

A(t) = A e- at " (1) 

while such a model has few free parameters (a, the 
underlying phenomena leading to such a kinetic response 
is typically complex (see e.g. the classic work [5]). If 
the underlying physics of the system is stationary and 
Markovian, then we expect any kinetic response to be 
composed of elementary processes that decay as expo- 
nentials 

A{t) = Y i A i e~ a < t (2) 

i 

One can readily verify by Taylor expansion that such a se- 
ries can reproduce any monotonically decreasing function 




FIG. 1. Illustration of the one-step process under consider- 
ation. The model consists of a linear sequence of N states, 
with a forward rate kf and backward rate kb. 

given a suitable choice of positive and Ai. Specifically, 
if there are many closely spaced exponentials a stretched 
exponential form can arise. The model presented here ex- 
plains one mechanism by which a stretched exponential 
could emerge. 

It should be noted that in protein folding, most 
stretched exponentials are equally well fit by a summa- 
tion of two or more elementary exponentials [6J [7], and 
thus researchers have a choice for how to fit their data. 
While the stretched exponential form may be more conve- 
nient for fitting data, it is phenomenological; the summa- 
tion of exponentials is microscopic. In some sense, then, 
the stretched exponential form is less appealing, because 
it leaves out a connection to the microscopic physics de- 
scribing the system. This is why models like the one pre- 
sented here are useful - they represent an intermediate 
step in connecting phenomenology and theory. 

Since multi-exponential kinetics have been observed 
in a number of experimental studies of protein folding 
[6j [9] , understanding the origins of this behavior is an 
imperative goal for theory. This is especially true since 
the experimental community has argued over what ex- 
perimental observations are sufficient to claim downhill 
behavior [T0HT2] . Moreover, downhill behavior is con- 
sidered a key prediction of theory, though the original 
theories describing protein folding were ambiguous as to 
whether or not a large free-energy bias resulted in multi- 
exponential kinetics [2]. 
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FIG. 2. The one-step model's relaxation spectrum can ac- 
count for a stretched exponential. Plotted in blue is a 
stretched exponential with a = 1, f3 — 0.75 and in black 
vertical lines, the weighted sum of exponential decays ([5]), 
with <y.i from the eigenvalues of the one-step model. Here 
kf/ku = 1.4 and N = 100. Weights A{ were chosen to match 
the stretched exponential. 

Some authors working in folding have been able to 
work around this difficulty, e.g. Bicout and Szabo showed 
that there were discrete exponential timescales in the 
mean first passage time distribution to the native state 
PP. Such a treatment, however, cannot account for ergod- 
icity in a systematic manner and therefore suffers from 
some limitations. 

Here we present proof that there is a transition from 
single to multi-exponential kinetics as a system moves 
from no bias to a heavy bias towards a native state. We 
do this by calculating the eigenvalues of a finite, homo- 
geneous, linear one-step process 

Pn = kfp n -! + k u p n +i - (kf + k u )p n 

illustrated in Figure [I] In this model, we consider the 
transitions of an ensemble of walkers (proteins) between 
a series of N states over time. In each state, the walk- 
ers move to "forward" (more native) at a rate of fcj, and 
backwards (less native) with rate k^. Our central result 
is a closed-form expression for the eigenvalues of the mas- 
ter equation described, which give the rates of relaxation 
that would be measured in a bulk experiment performed 
on such a system. 

This process is a discrete analog to the one-dimensional 
reaction coordinate picture of protein folding, where in 
the downhill scenario the free energy decreases mono- 
tonically as one approaches the native state. The pre- 
sented model has a discrete timescale spectrum, as ob- 
served experimentally in studies of protein folding. Fur- 
ther, the discrete spectrum is fine enough to describe a 
stretched exponential (Fig. [2| . By scaling the the fold- 
ing bias, we show that there is a clear transition from 
single-exponential kinetics at low biases (flat /golf-course 




FIG. 3. Illustration of the analogy between the model consid- 
ered and "downhill" free energy profiles. Top: The timescale 
spectra at various levels of bias, showing the transition from 
a large timescale separation at low bias to a more finely space 
spectrum. Bottom: The free energy profiles that correspond 
to these spectra, labels are the value of bias kf/kb. 



profiles) to multi-exponential kinetics at large biases. 

Here, we ignore the larger (and more interesting) ques- 
tions of whether or not a linear model such as the one 
presented is a good model of folding. Instead, we wish 
to simply show that a sufficient native bias does lead to 
multi-exponential kinetics, while in lieu of any bias one 
obtains single exponential behavior. We hope that the 
analysis presented here will help clean up some of the 
confusion for under what conditions we can expect such 
behavior. 

As a final note, while the model presented is quite sim- 
ple, and should be common in fields other than protein 
folding we have not been able to find a clear solution pre- 
sented in the literature. We found this quite surprising 
considering this model may have wide applicability. We 
present the diagonalization of an important class of tridi- 
agonal matrices that we expect appears in many fields. 
Note that solutions for similar models have been pub- 
lished, but not this exact one p~3j [14]. Hopefully the 
solution presented here can be used in other applications. 

First, we present the mathematical description of the 
model and its solution. We proceed to discuss interest- 
ing aspects of this solution. The reader uninterested in 
mathematical detail can stop there - the rest of the paper 
is devoted to a proof of the solution. 
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SOLUTION AND ANALYSIS OF THE MODEL 

The one-step model just described is equivalent to the 
master equation 



dP 
~dt 



= KP 



where P is an TV-vector of the state populations, and K 
is a rate matrix of the form 



K 



-k f k f 
h -(k f + k b ) k f 











h -(kf + k b ) kf 
k b -k b 



In what follows, we will show that the eigenvalues of 
this matrix are 



At = 







-{k f + /c 6 ) + 2^/kJF b cos 



k 
k 



1 

2,. 



..,7V 



(3) 



where N is the size of K, and k indexes the eigenvalues. 
This formula shows the eigenvalues of such a matrix are 
equally spaced at intervals of 2^/kfk b . 

Using this equation, we can easily calculate the condi- 
tions for single versus multi-exponential behavior. These 
eigenvalues are rates of relaxation, and are directly 
related to the characteristic relaxation timescales of the 
system 



Tk 



1 



looking at the spacing of these timescales tells us 
whether or not the system would appear single or multi- 
exponential to an experiment (Fig. [3]). If there is a large 
gap between the first (T2) and second (ts) timescales, 
then the system is "single" exponential, while if the gap is 
small the system is multi-exponential. Note that t\ = 00, 
and represents the stationary solution, such that there 
will always be N — 1 observable timescales, regardless 
of whether we talk about the dynamics being single or 
multi-exponential. 

To help quantify the two regimes of interest, we intro- 
duce the kinetic isolation, AT23/V2, where AT23 =72 — 73- 
This value provides a normalized measure between and 
1 of the size of the separation between the first and second 
observable timescales. With Eq. ([3| in hand, the calcu- 
lation of the kinetic isolation is trivial - Figure [4] shows 
the kinetic isolation as a function of increasing bias. The 
figure clearly shows that around kf/k b « 1.2, a relatively 
small bias, there is a dramatic shift from single exponen- 
tial behavior (large kinetic isolation) to multi-exponential 
behavior (small isolation). 




FIG. 4. The kinetic isolation AT23/T2 of the model as a func- 
tion of the ratio of the forward to backward rate, kf/kb. The 
isolation is normalized by the longest timescale T2 . There is a 
clear transition from single to multi-exponential behavior at 
around kf/kb ~ 1.2 



This figure represents our central result, which is sim- 
ply verification of the claim that as a one-dimensional 
model moves from a "golf-course", or unbiased free en- 
ergy landscape, the kinetics shift from single to multi- 
exponential. 



SOLUTION OF THE MODEL 

What follows is simply the mathematical justification 
for Equation [3J the closed- form expression for the eigen- 
values of K. 

Laplace's expansion in minors gives the characteristic 
polynomial of a matrix recursively in terms of the minors 
of the matrix. For generic tridiagonal matrix A = (a^-), 
it can be shown that the the expansion for the charac- 
teristic equation takes the simplified form 



\A-n\ — Q"n,n\A n — \\ 



a, 



n— l,nQ"n,n— 1 



(4) 



with 



\A \ 1 and \Ai \ = a x 



where | • | denotes a matrix determinant and Ai indicates 
the i th principle minor of A, that is the matrix formed 
by taking the first i rows and columns of A. 

We begin by re- writing K for notational convenience. 
Let 



—a 



a 

-(a + b) a 







b —(a + b) a 
b -b 



4 



with a = ^kf/k u and b = ^/k u /kf, for reasons that 
will be clear in a moment. By this definition, K = 
(kfk u )~ 1 / 2 A, and also ab = 1, which will be a key later. 

Let's quickly sketch our calculation strategy. We will 
show A is similar to the matrix 



where 



A' = 



-(a + b) a 

b — (a + b) a 



-(a + b) 
b 



and then calculate the eigenvalues of this matrix, which 
is a much easier task than directly finding the eigenvalues 
of A. We then scale the eigenvalues of A to recover the 
eigenvalues of K. 



Transformation of A to A' 

To show that A is similar to A' , we find a transforma- 
tion P such that 

A' = P~ X AP 



We assert that the matrix 



1 1 1 
1 1 



with inverse 



P- 1 



1 -1 
1 -1 



1 -1 
1 



satisfies this requirement. That A! = P _1 AP is readily 
verified via a tedious but simple calculation. Similarly, 
showing P~ X P = I is trivial but tedious. A more elegant 
method for the verification of these facts surely exists, 
but has not been given much effort since the brute-force 
method is quite effective. 



The First Eigenvalue of A! 

Now, let us find the eigenvalues of A' . First, apply Eq. 
Q once to A' — XI to obtain 



N-l 



-(a + b) a 

b —(a + b) a 



-(a + b) 



this shows that the first eigenvalue of A is A = 0, and 
that the rest of the eigenvalues can be found from the 
roots of the characteristic polynomial of A' N _ 1 — XI. Let 
us find these roots. 



Characteristic Polynomial of A' N _ 1 

We will now find the characteristic polynomial of 
A' N _ 1 — XI. For notational ease, set 

t = -(a + b) - X 

which are the diagonal elements of A' N _ 1 — XI. 

From Eq. Q, the polynomial of A' N _ 1 — XI can be 
clearly seen to oe given by the recurrence relation 

P n = tP n -\ — P n -2 

to solve this, introduce the characteristic equation 

x 2 - tx + 1 = 
which has two (complex) roots 



t . .V4^¥ 

x = - ± i 

2 2 

which, when recast into polar form such that x = e ±ze 
can be written 



t 



cost 



suit 



V4^2 



implying 



tan# 



V4^2 



The general solution for the recurrence relation is 

P n = acos(m6) + j3 sm(m6) 

with m the being the size of A' N _^ in this case N — 1. 
We will retain m for the moment for conciseness. 

Now let us find the coefficients a and (3. From the 
starting conditions of Eq. Q , we have 



a = 1 



and 



A' — XI = -X\A' N _ 1 - XI\ 



Pi = acos(m0) + f3sm(m9) = t 
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substitute our expressions for a, cos# and sin# to get 



f3= V4 12 =tanfl 



such that our final solution is 
P n = cos(mO) 
with explicitly given by 

= cos - 



sin(ra#) 
tan# 



a + 6 + A 



Converting the Roots into Eigenvalues of K 

Our aim is to find the zeros of P n , which are the eigen- 
values of interest. To do this, rearrange Eq. (§ to get 

sin cos(ra#) + cos sin(ra0) = 

which by a common identity is 

sin(0 + mO) = sin(0 + (N - 1)0) = 

where we have re-substituted m = N—l. This is satisfied 
when NO = jn, with j one of the integers. 

These values of our the roots of interest, but we want 
them in terms of A. To convert them, recall that before 
we had cos# = t/2 and t = —(a + b) — A. Combine these 
with the expression from to obtain 



A^ = — (a + b) + 2 cos 



J7T 

7V 



where a subscript A indicates these are the eigenvalues 
of A' (and therefore A). In this form, it is clear that 
we must restrict the values of j to obtain N—l unique 
eigenvalues. Due to periodicity, there are many choices 
that will suffice, but the integers k = 2,3, ...,7V will be 
natural, since then k is an appropriate eigenvalue index 
(recall we must include Ai = as the first eigenvalue). 



To convert the A^ into the eigenvalues of K, we need 
to scale them by (kfku) 1 / 2 . Substitute for a and 6, and 
multiply by this scaling factor to obtain 



A (k f k u ) 



1/2 



-\Jkf/k u - ^k u /k f 



2 cos 



(5) which reduces to (J3|, our stated solution. 
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